Inside-Outside Estimation Meets Dynamic EM

نویسنده

  • Detlef Prescher
چکیده

We briefly review the inside-outside and EM algorithm for probabilistic context-free grammars. As a result, we formally prove that inside-outside estimation is a dynamic-programming variant of EM. This is interesting in its own right, but even more when considered in a theoretical context since the wellknown convergence behavior of inside-outside estimation has been confirmed by many experiments but apparently has never been formally proved. However, being a version of EM, inside-outside estimation also inherits the good convergence behavior of EM. Therefore, the as yet imperfect line of argumentation can be transformed into a coherent proof. 1 Inside-Outside Estimation The modern inside-outside algorithm was introduced by [4] who reviewed an algorithm proposed by [1] and extended it to an iterative training method for probabilistic context-free grammars enabling the use of unrestricted free text. In the following, y1 . . . yN are numbered (but unannotated) sentences. Definition: Inside-outside re-estimation formulas for probabilistic context-free grammars in Chomsky normal form are given by (see [4], but see also [1] for the special case N = 1): p̂(A → a) := ∑yN w=y1 Cw(A → a) ∑yN w=y1 Cw(A) , and p̂(A → BC) := ∑yN w=y1 Cw(A → BC) ∑yN w=y1 Cw(A) . The key variables of this definition are so-called category and rule counts: Cw(A) := 1 P ∑n s=1 ∑n t=s e(s, t, A) · f(s, t, A), Cw(A → a) := 1 P ∑ 1≤t≤n, wt=a e(t, t, A) · f(t, t, A), and Cw(A → BC) := 1 P ∑n−1 s=1 ∑n t=s+1 ∑t−1 r=s p(A → BC)e(s, r, B)e(r + 1, t, C)f(s, t, A) which are computed for each sentence w := w1 . . . wn with so-called inside and outside probabilities: An inside probability is defined as the probability of category A generating observations ws . . . wt, i.e. e(s, t, A) := p(A ⇒ ∗ ws . . . wt). In determining a recursive procedure for calculating e, two cases must be considered: • (s = t): Only one observation is emitted and therefore a rule of the form A → ws applies: e(s, s, A) = p(A → ws), if (A → ws) ∈ G (and 0, otherwise). • (s < t): In this case we know that rules of the form A → BC must apply since more than one observation is involved. Thus, e(s, t, A) can be expressed as follows: e(s, t, A) =

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Inside-Outside Algorithm

This note describes the inside-outside algorithm. The inside-outside algorithm has very important applications to statistical models based on context-free grammars. In particular, it is used in EM estimation of probabilistic context-free grammars, and it is used in estimation of discriminative models for context-free parsing. As we will see, the inside-outside algorithm has many similarities to...

متن کامل

Influence of Formulation Parameters on the Release of Diclofenace Sodium from Matrices with Manufacturing Formulation Ingredients

Effects of formulation parameters on the fractional release profile of diclofenac sodium from matrices having the manufacturing formulation ingredients are studied. As a content of cetyl alcohol (rate controlling agent) in the matrix increases, the fractional release decreases. The fractional release increases either by increasing sucrose content outside the granule or by decreasing sucrose...

متن کامل

Patch Enclosure and Localized Effects of Selected Acacia Species on Herbaceous Richness and Soil Properties of Rangelands in Somali Regional State in Ethiopia

Enclosure and Acacia shade availability to plants are basic variable in arid and semi-arid rangelands. the aim of this study was investigation the impact of patch enclosure and Acacia shade using four treatments which are Inside Enclosure Under Acacia shade (IEUA), Inside Enclosure Without Acacia shade (IEWA), Outside Enclosure Under Acacia shade (OEUA) and Outside Enclosure W...

متن کامل

Stochastic Analysis of Lexical and Semantic Enhanced Structural Language Model

In this paper, we present a directed Markov random field model that integrates trigram models, structural language models (SLM) and probabilistic latent semantic analysis (PLSA) for the purpose of statistical language modeling. The SLM is essentially a generalization of shift-reduce probabilistic push-down automata thus more complex and powerful than probabilistic context free grammars (PCFGs)....

متن کامل

In-Between Space, Dialectic of Inside and Outside in Architecture

Defining space by dividing it to inside and outside is one of human’s ways to recognize his positionin environment. Architecture is created to response to this need for inside/outside spaces. Design of inside and outsideSpaces and relation between them always has been one of necessities for definition and limitation of human livingspaces, but little attention to relation of this two spatial rea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/cs/0412016  شماره 

صفحات  -

تاریخ انتشار 2001